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Poccuiicxas Dexepayua 


Introduction. Algorithms for the parallel binary tree construc- 
tion are developed. The algorithms are based on sorting and 


N 


described in a constructive form. For the *’ element set, the 


T(R)=O0) ang 7 (R) = Olog, N) 


time complexity has esti- 


2 
mates, where ENO es 


is the number of processors. 
The tree is built with the uniqueness property. The algorithms 
are invariant with respect to the input sequence type. The work 
objective is to develop and study ways of accelerating the 
process of organizing and transforming the tree-like data struc- 
tures on the basis of the stable maximum parallel sorting algo- 
rithms for their application to the basic operations of infor- 
mation retrieval on databases. 

Materials and Methods. A one-to-one relation between the 
input element set and the binary tree built for it is established 
using a stable address sorting. The sorting provides maximum 
concurrency, and, in an operator form, establishes a one-to-one 
mapping of input and output indices. On this basis, methods 
for the mutual transformation of the binary data structures are 
being developed. 

Research Results. An efficient parallel algorithm for construct- 
ing a binary tree based on the address sorting with time com- 


plexity of EO = Ole: N) is obtained. From the well-known 
analogues, the algorithm differs in structure and logarithmic 


estimation of time complexity, which makes it possible to 


achieve the acceleration of ON S15 &21 order analogues. As 
an advanced version, an algorithm modification, which pro- 
vides the maximum parallel construction of the binary tree 
based on a stable address sorting and a priori calculation of the 


stored subtree root indices is suggested. The algorithm differs 


in structure and estimation of 7 = 9) time complexity. A 


similar estimate is achieved in a sequential version of the mod- 
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Beedenue. PazpaOoTaHbI asIrOpHTMBI MapasieIbHOrO MOCTpo- 
CHA JBOHYHOrO JepeBa. AJIFOpHTMBI BbIMOJIHCHbI Ha OCHOBE 
COPTHPOBKH H ONMCaHbI B KOHCTpyKTHBHOH dopme. Jia MHO- 


2KCCTBa H3 N 


T(R) = O(1),, T(R) = Odog, N) 


SJIEMCHTOB BPCMCHHadA CJIOWKHOCTb HMCCT OLCH- 


KH » THe AHCIO Mpolwecco- 


2 
pos ® gh le Se J\epesBo crpoutcsa co CBOMCTBOM eJMH- 
cTBeHHOocTH. AJITOpHTMbI HHBapHaHTHbI OTHOCHTeIIbHO Bua 
BXOQHOH NocmeqoBaTembHocTuH. Llembro paOoTEI s«BIAIaCch 
pa3paOoTKa HM uccleqoBaHHe ciocoboB ycKopeHua mporecca 
opraHv3alHH MW peoOpa30BaHHii ApeBOBHAHBIX CTpyKTyp 
WaHHbIX Ha OCHOBe aJIFOpHTMOB ycCTOW4MBOM MaKCHMaJIbHO 
TlapasWieIbHOM COPTHpOBKH [Id HX MpHMeHeHHA K Oa30BbIM 
olepalluimM HHPOpMalMOHHOrO NONCKa B 6a3ax aHHBIX. 
Mamepuaavi u memodvi. B3auMHO OJHO3HAYHOe COOTBET- 
CTBH€ MHO2KCCTBAa BXOJIHBIX 3JICEMCHTOB HM NOCTpOeHHOrO LA 
Hero J{BOHYHOTO JepeBa ycTaHaBJIMBacTcaA pH TOMO 
YcCTOMYHBOH alpecHol copTupoBKH. CopTuposKa oOsaqaeT 
MaKCHMAaJIbHBIM TapaluieIM3MOM, B olepaTopHoli dopme 
yCTaHaBJIMBaecT B3AMMHO OJHO3HAYHOe COOTBETCTBHe BXOJ- 
HBIX MH BbIXONHBIX MHekcoB. Ha sto ocHoBe pa3padaTEIBa- 
IOTCA MCTOLI B3aHMHOFO MpeoOpa30BaHHA JBOMYHBIX CTpyK- 
TYP aHHBIX. 

Pesyismamei uccaedosanua. Monysen 9eKTHBHBIM Mapal- 
JICIbHbIM asIrOpUTM MOCTpOeHHA JBOHYHOrO epeBa Ha OCHOBE 
ajjpecHoli COpTHpOBKH C  BPeMCHHOM  CJIOXKHOCTBIO 
T(N *) = O(log, N). OT U3BeCTHBIX aHaJIOroB asIrOpHT™M OTJIH- 


yaeTCd CTpyKTypol 4 JorapupMuyecKol OWeHKOM BpeMeHHO!i 
CJIOXHOCTH, NO3BOJMEOWeH BOCTHraTb yCKOpeHHsA aHasloroB 
Topayka O(N"), a 21. B kayecTBe ycoBeplieHCTBOBaHHOro 
BapHaHta TpeqoxeHa MOAMpuKalMA alroputMa, oOecneqH- 
Balollad MaKCHMAaJIbHO MapasWIeIbHOe NOCTpoeHHve BOHYHOrO 
qepepa Ha OCHOBe ycCTOM4MBO alpecHoli COpTHpoBKH U 
allpHOpHOro BEIYHCICHHA XPpaHHMBbIX HHJICKCOB KOpHel MoL- 
qepeppes. AJITOpHTM OTNMYAeTCA CTpyKTypoH wu oOleHKoli 
BpeMeHHOHM cyloKHOcTH 7(1)=O(1). AnanorM4Had oleHkKa 


AOCTHTaeTcA B MOCIICTOBATCJIBHOM BapHaHTe mMoyMdunnpo- 
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ified algorithm, which allows obtaining the acceleration of 


O(N") a>1 


known analogs of order. 


Discussion and Conclusions. The results obtained are focused 
on the creation of effective methods for the dynamic database 
processing. The proposed methods and algorithms can form an 
algorithmic basis for an advanced deterministic search on the 
relational databases and information systems. 
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BaHHOrO aJITOpHTMa, 4YTO HO3BOAeT OCTHTaTb yCKopeHHa 
V3BeCTHBIX aHayoroB MopayKa O(N"), a>1. 

O6cyacoenua u 3akmouenue. [lomyaeHHbie pe3yIbTaTbEl 
HalipaBJICHbI Ha OpraHH3allHio 9—(PeKTHBHBIX CIOCcOOoB AHHa- 
MuyecKol OOpaboTKH 6a3 WaHHEIx. IpeqmoxeHHEIe crocoOnl 
Mf aITOPHTMBI MOTyT COCTaBHTb asIFOpPHTMHYECKYIO OCHOBY IA 
YCKOpeHHOrO JjeTepMHHHPOBaHHOTO TOMCKa B PeJIAWMOHHBIX 


6a3ax JJaHHBIX VMHOpMallMOHHBIX CHCTe€Max. 


KsouesBbie C10Ba: CTIpyKTypbl aHHBbIX, aIrOpHTMbI OOpa- 
OOTKH JaHHBIX, JBOHYHOe JepeBo, ayIrOpHTMBI MapasiesbHOH 
COpTHPOBKH. 


Oopazeu Ona Yumupoeanua: Pom, A. E. Hapannenbyoe 
TlOcTpoeHve BOMYHOTO JepeBa Ha OCHOBe cOpTHpoBKH / 
A. E. Pomm, J]. A. Uadantox // Bectuuk JJon. roc. TexH. yH-Ta. 
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Introduction. There is a tendency to the convergence of parallel information processing technologies and various 
processor architectures in the field of modern high-performance computing. Despite the variety of processor architectures and 
ways of presenting information, the idea of parallel processing is one of the most important tasks of computer science to in- 
crease the data-rate. To accelerate processing speed, the authors propose to use a stable address sorting algorithm with maxi- 
mum parallelism. 


Method of parallel construction of a binary tree. For A= ( G5: Ais.45 35 a4) array, the comparison matrix is developed 
according to [1, 2]. a,, element of this matrix is defined as 


+5. Oe, 
a,, =sign (a,-a,)= 0, a, =a, 


-, a <a 
5 <4; 


> 


where i, j=1, 2, ...,.n. 
a, element in Cale. Cis sy e.) sorted array gets the number k =24,; , where a,,20 ati<j,a,,>0 at 


i> j. All comparisons are mutually independent; the sorting is stable and as parallel as possible with the estimate of 


2 
T (=) = O(1) time complexity. On this basis, you can perform a parallel construction of a binary tree [3, 4]. 


Suppose we are given a set of N elements, all elements of which are represented as a single-dimension array. 
On the set, < ordering relation is assumed. It is required to convert the array into a binary tree. For this, the described 


array sorting is performed. C medial array cell has j,, = =] index and is taken as the root of the tree [3]. All C array 


components to the left of C, form a left subtree (left subarray). The components to the right of C, form a right subtree 


(right subarray). The left subarray is interpreted as a new array. It similarly contains 








i, —l 
5 5(F] i [és root index. Here, C, | is the left-nearest descendant of the root of C, tree. All 


components of the subarray to the left of See do not exceed Ceca all components of the subarray on the right are 








; 1]. 
not less thanC,  . Simultaneously, the root index of 7. ins. = [= )+]3 |=] | ty +] J 5 right subarray 
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is determined. At this, C, | is the nearest descendant of the root of C, tree. The process recursively resumes in 
each pair of the adjacent subarrays: 
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ea 8 SS SSS SSS SSS 


. = J ies vt | 
TiSoess vei 2 ? 
» = # + J oy ce y2it = 
J es siez: 2,2 Tt sees v2 2 ? 


. _: FT ssp: y2i4 “re ST sti. v2 ~ 1 
Sai: v1 S essai. y2it 2 7 





J; aa J ant 
. _ > p. pas. 1/2' cep. mpas. 1/2’ + 
Sg sa: 2,2 S spins y24 ‘| ot i; 2, us| log, N. 





2 


As a result, all components of the lower level of the binary tree are formed in O(1) time. The process can con- 


tinue until log, N exhaustion of the levels of the binary tree. 


The number of algorithm steps for constructing the binary tree in a parallel form is the sum of the sorting step 


and the step sequence when calculating the indices of the roots of subtrees. From here, 
T (R) =log, NT+ T= O(log, N Vs where R is the number of processor elements, tis the time of binary comparison, 
and Tis the time for calculating one root index. R number of processors is determined by the maximum N parallel 
sorting of input elements, and then — through the calculation of indices with doubling by the number of tree levels. 


When calculating the indices, this number will not exceed 2°"”' = N/2, therefore the number of processors involved 


7_N 





; N : : . 
in sorting is sufficient. As a result, R will be less than [3]. Finally, the time complexity of the parallel algo- 


rithm for constructing the binary tree will be 





7(*) = O(log, N). 


Example [3]. The binary tree for an array of 15 elements 
X =(14, 9, 24, 7, 11, 20, 28, 3, 8, 10, 13, 17, 21, 25, 30) is constructed as follows. 
The result of the sort is the array 

C=(3, 7, 8, 9, 10, 11, 13, 14, 17, 20, 21, 24, 25, 28, 30) 


1 
The root of the binary tree is the medial element of C array: j,, -| 3-8. C,=14 The left subarray has 


-1 : ne 
Sea nes. 2 = 4 root, C, =9 element is the root of the left subtree, which is the left-nearest descendant of C, 


: —1 ; . 
medial component. The right subarray has Le sneeie = |S =12 root, C,, =24 element is the root of the right 


: . 4-1 
subtree and the right-nearest descendant of C,, root. Further, Ts genie -| 5 


the subtree on the left and the left-nearest descendant of the root of C, . 





= 2, C,=7 element is the root of 


subtree. In the right subtree, the root has 


1/2 


4-1 : : : ; 
deg vena 4+ S] = 6 number, C, =11 element is the root of the right subtree and right-nearest child of C sie 





Similarly, to the left of C, 


pas. 1/2 


12-8-1 : ; ; 
a =12 -| 5 }10 root is determined, C,, = 20 element is the root of 
p. mpas. 1/4,1 2 10 
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the left subtree from it and the left-nearest descendant of C, __. subtree root. For the subarray, adjacent to the right one 


pan. 1 





12-8-1 : 
discussed above, the root has j,,. as 4,2 124] 5 J number, C,,=28 element is C, | right-nearest 


descendant and the right subtree root. The lower level of the tree will be formed by the descendants remaining to the left 


and to the right of each of four identified roots (Fig. 1): 


ies 











cp 
0 level Cg= 14 
Jep, aes, 1/2 — Jen: pas. 1/2 =12 
1 level 
Jep. nes. 1/4,1 = 2 Jp. sep. 1/4,2 — 6 Jop. npas. 1/4,1 = 10 Jep. npas. 1/4,2 = 14 
2 level 0 
Jop. nes. 1/81 = Jep. new. 8,2 = 3 Jeg, nes. /8,3 => Jep. nea. 1/8,4 = 7 Jey, mpap. 1/8,1 — 9 Joep, nas. 1/8,2 — aI Joep, pas. 1/8,3 13 Joep. npas. 1/8,4 — 15 


Fig. 1. Example of constructing binary tree based on sorting 


There is Theorem 1 [3]. For a single-dimensional array of N components, a binary tree can be built in parallel 


using sorting with T (=) = O(log, N) time complexity. 

The used sorting is stable; the binary tree is implied to be constructed with uniqueness. The indices of all me- 
dial components (all roots of subtrees) can be identified [3]. Considering this modification, all the indices from the 
above example for N subtree values can be calculated synchronously and mutually independently. This leads to a sin- 
gle estimate of the build time of the binary tree. For each specific N , all the values of the tree node indices can be cal- 
culated a priori and stored in the computer memory. With their help, the sorted components can be synchronously and 
mutually independently addressed to all the tree nodes. Formulas for calculating the node indices depend only on the 
total number of N input elements and are in no way dependent on their mutual arrangement after the stable sorting. To 
simplify memory addressing, the computed indices can be ordered at each level and arranged in ascending levels. Then, 
the entire population of the ordered node indices is read from N key. It only remains to arrange the sorted tree elements 
by the read-in addresses. Based on the above, there is 


Theorem 2. For a single-dimensional array of N components, a binary tree can be built in parallel using sorting and 
N? 
prior calculation of indices with T (=) 7 O(1) time complexity. 


The following unified table contains the formal estimates of time complexity of sequential and parallel algo- 


rithms for constructing a binary tree versus the proposed algorithms. 
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Table 1 


Comparative estimates of time complexity of sequential and parallel algorithms for constructing binary tree versus 





















































proposed algorithms 
Accelerati h i 
; . Algorithm time Acceleration when using unit — ies oa mie ee 
Binary tree algorithm : ; ; : logarithmic time-complexity 
complexity time-complexity algorithm ; 
algorithm 
Algorithm of A. ‘ai a. sal ~ uae 
a ze T N# T Ne 
Lagana and V. Kumar F=o[N [5] woe i a: Oo eco = O(NIn2) 
(2004) [5] O82 
Algorithm of P. - 7 7 N? 
Chalermsook (2015) T=O(N’) [6] —=0(N’) =O = O(N? In2) 
[6] T T log, N 
; . Z = : 
Polynomial algorithm i O(N") [7] x : O(N’) L -O N 7 O(N’ In?) 
(2016) [7] T T log, N 
“Left child — right 7 nN? 
sibling” algorithm T =O(N’) [8] —=0(N") 7 Ge w TOW 2) 
(2014) [8] f 08: 
Pattern-based T= O(|D| log, D) T 6 |D\log, D Tr a |D|log, D 
algorithm (1991) [9] [9] T 1 T log, N 
The presented 
algorithm with 
logarithmic estimate T= O(log, N ) [3] — a 
of time complexity 
(2015) [3] 
The presented 
algorithm with single T=0 (1) 7 _ 
estimate of time 
complexity (2015) [3] 

















In Table 1: D is capacity of the template dictionary, N is the number of input elements of the binary tree, k 


is the dimension of the space in which sorting is performed. 


The table shows that the proposed algorithm with a logarithmic estimate of time complexity abstractly im- 
proves estimates of the known algorithms. Minimum acceleration is achieved with respect to the algorithm from [5]: 


T 


= = O(N . In2) ,or —= O(N i and maximum acceleration is achieved relative to the polynomial algorithm from [7]: 


Z 
P 
. 





3 
, = of; al 5] or 7 = (N ‘) . Regarding the proposed algorithm with a single estimate of time-complexity, the eval- 
0g, 


uation of the known algorithms also improves. In this case, minimum acceleration is achieved relative to the algorithm 


from [5]: = =O(N ) , and maximum acceleration is achieved with respect to the polynomial algorithm from [7]: 


r : 
AON’). 


Conclusion. The developed algorithms differ from the known techniques [5—7, 10, 11] of constructing a binary 
tree in that they use maximum parallel sorting to calculate the indices of the nodes. In this case, either a logarithmic 
number of steps is consumed by building a tree, or additional time is not spent at all, if the values of the indices are a 
priori calculated for all N values in some real boundaries and stored in the computer memory. The proposed parallel 
algorithm for constructing a binary tree can be used to organize efficient methods for dynamic processing of databases. 
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